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SUMMARY 


Let  7i  ^  , .  .  .  ,  j?  k  be  k  given  populations. 


Ac 

Dist 


P 


Assume  that  we  wish  to  find  a  population  bet  XT  r  ttnrrr 
given  control ,  if  there  is  any.  from  all  populations  we  may  draw  inde¬ 
pendent  samples  with  di stributions  which  are  (at  least  partly)  determin¬ 
ed  by  real  parameters  say.  A  population  is  viewed  to  lx* 


better  than  t he  control  if  o. 


1 


uq9  i=l,...,k,  where  n  <  JK  is  a  fixed 


given  constant.  The  goal  is  to  guarantee  at  least  a  probability  P*  of 


making  a  correct  decision  if  o. 


,  i=l,...,k,  and  to  maximize  the  prob 


ability  of  finding  a  population  better  than  ,  otherwise. 


Two-stage  procedures  of  the  following  type  will  be  studied:  At 
Stage  1,  based  on  samples  all  populations  are  screened  out 


which  appear  to  be  no  better  than  n  If  none  (exactly  one)  is  left  the 


procedure  stops  and  decides  that  none  (this  one)  is  better  than 


1  f 


more  than  one,  n-  with  i  is, survives  then  one  proceeds  to  Stage  Here 


additional  samples  Y-,  i  Is,  are  drawn  and  final  decision  is  made  based 


on  or  ( >; . ,  Y  . ) ,  i  (  s . 
-  i  ’  - 1 ' 


A  natural  class  of  two-stage  procedures  is  proposed  which  cun  or 
completely  described  and  studied  in  terms  of  Neyman-Pearson  testing  the¬ 
ory,  where  the  unsymmetry  of  tests,  however,  can  be  overcome  to  a  consid¬ 
erable  extent.  As  a  typical  result  it  is  shown  that  optimality  ot  tests 
carries  over  to  optimality  of  two-stage  procedures,  finally,  under  nor¬ 
mality,  comparisons  are  made  in  case  of  k  ?  with  certain  It.iyesiuri  pro¬ 
cedures  . 


/ 


r 


I .  Introduction 

If  k  populations  are  given  and  we  wish  to  decide  on  the 

basis  of  a  properly  chosen  sampling  scheme  which  one  ol  these  popula¬ 
tions  is  the  best  one  (e.g.  has  the  largest  mean),  various  different  ap¬ 
proaches  and  methods  have  been  studied  up  to  now.  A  complete  overview 
is  provided  by  Gupta  and  Panchapakesan  (1979).  Among  those,  two-stage 
procedures  with  screening  in  the  first  stage  seem  to  be  puite  appropri¬ 
ate,  since  they  are  more  economical  as  one-stage*  procedures  but  still 
technically  not  as  complicated  as  sequential  ones.  Nevertheless,  opti¬ 
mal  ity  results  here  are  missing  up  to  now  and  obviously  are  hard  to  find. 
Even  the  implementation  of  a  simple  procedure  (as  that  one  which  uses 
Gupta's  (1965)  maximum  means  procedure  in  the  first  stage  and  the  nat¬ 
ural  final  decision  in  the  second  stage)  in  an  indifference  /one  approach 
under  the  assumption  of  normality  with  a  common  known  variance  causes 
considerable  difficulties.  For  details  arid  references  see  Tamhanr  ami 
Bechhofer  (1979),  Gupta  and  Miescke  (19/9;  and  Miescke  and  Sehr  (1980). 

The  situation  becomes  somewhat  fairer  i f  we  wish  to  find  a  popula¬ 
tion  better  than  a  control  :i  ,  whether  it  is  known  or  unknown.  This  be¬ 
cause  pairwise  comparisons  are  tc  be  made  now  he  tween  •  and  ■  instead 

of  ii.  and  ii  y  i  ^  j ,  i,  j  €  let  us  admit  here  additionally  the 

choice  of  a  final  decision  "none  of  the  p«  *  i. lui  i.ns  m  Prater  I.  nun  the- 
control".  Moreover,  let  us  adopt  the  following  has  if  •  .yi  '■  feme  i -o 

goa  I  : 

P*-Condi  t  ion :  Let  P*C{U,1)  be  a  predetermined  constant.  -  pruo*  .P.m-: 

is  said  to  meet  the  F'*- condi  1 1  on  if  it',  probatn  1 1  ty  of  muling  the 

final  decision:  "none  of  » he  popn I  a*  i ons  is  better  !  rue  the  cost  ml1' 


is  at  least  P*  whenever  this  decision  is  curved. 


Qoalj  Among  all  procedures  (ir  a  given  class)  which  meet  the  P*-condition 
find  that  procedure  which  maximizes  the  probability  of  deciding  final¬ 
ly  in  favor  of  a  population  better  than  the  control  if  there  is  any. 
The  purpose  of  this  paper  is  to  show  that  a  natural  class  of  two-stage 
procedures,  being  widely  used  in  practice,  can  be  described  and  studied 
within  the  framework  of  Neyman-Pearson  testing  theory,  where  the  unsymme- 
try  of  tests  can  be  overcome  to  a  considerable  extent.  Emphasis  hereby 
is  laid  on  the  basic  structure  and  on  comparisons  of  such  procedures  rather 
than  on  establishing  specified  ones. 

In  Section  2  we  introduce  a  natural  class  i,  ’  of  two-stage  procedures 
and  derive  a  formula  for  their  probabi  1  i ties  of  correct  decisions.  A:,  a 
typical  consequence  it  will  be  demonstrated  in  Section  3  that  two-stage 
procedures  based  on  good  unbiased  one-sample  tests  for  H. :  "population 

is  better  than  v  "  versus  K • :  "population  t  .  is  inferior  with  respect 
i  oil 

to  rrQ",  which  are  simultaneously  good  tests  for  the  dual  problem  (where 
H.j  and  are  i  nterchanged) ,  perform  well.  Three  open  questions  conclude 
this  section.  Finally,  in  Section  4  we  study  the  normal  case  and  make 
some  comparisons  with  certain  Bayesian  procedures  in  the  case  of  k  2 
populations. 


2.  Basic  results 

Suppose  that  for  every  >j.,  i  - 1,..  , k,  we  have  a  family 
f f i  g  }  g  ^  (  ^  of  densities  with  respect  to  the  Lebesgue  measure* 

or  any  counting  measure  on  IK  which  have  a  common  vpport  Q '  D*!  and  may 
be  known  or  onlv  partly  known.  The  assumption  concerning  the  supports 
is  made  for  convenience  to  make  ideas  clearer  and  can  be  weakened  m  cer- 

■  ,  '-ay. 


tain  circumstances.  Let  the  fixed  known  control  be  denoted  by 


and  all  populations  71*  be  called  better  than  the  control  if  •  .  *  and 

1  10 

inferior  to  it  if  t>.  <  oQ.  Let  ^  -  (X^ . Xin  )  and  -  ( Y i  1 . Y i m -  ^ ' 

i=l,...,k,  be  samples  from  71.  available  at  Stage  1  and  Stage  2,  respective¬ 
ly,  where  ,Y^ . ,X^,Y^  are  mutually  independent,  and  let  X  ~  (X^,...,X^) 
and  Y  =  (Y1 . Yfc ) . 

Before  we  are  going  to  define  a  natural  class  of  two-stage  procedure, 
in  a  concise  way,  let  us  briefly  describe  how  these  procedures  typically 
are  applied  in  practice.  For  every  testing  problem  K. :  *  •;  versus 

^  u  the  experimenter  chooses  a  test  based  on  and  for  tu  ver¬ 
sus  another  one  based  on  Y-  or  (X.,Y.),  i  l,...,k,  takes  two  levels 
*1  C  (0, 1 )  (which  usually  are  small)  and  proceeds  as  follows: 

At  Stage  1  he  discards  all  populations  which  are  not  significant  at 
level  d-j  under  the  first  set:  of  tests.  If  none  (exactly  one)  is  left,  fie 
decides  that  none  (this  one)  is  better  than  the  control.  Only  if  more 
than  one  population  survives,  he  proceeds  to  Stage  2. 

At  Stage  2  the  experimenter  draws  additional  samples  Y.  from  those 
11 .  1  5  which  were  selected  at  Stage  1  and  exchanges  hypotheses  and  altenui- 
tives  with  respect  to  these  populations.  If  all  these  populations  now 
turn  out  to  be  significant  at  level  under  the  second  set  nt  tests 
(which  is  rather  unlikely  to  happen  1  he  decides  That  none  of  Mu*  popula¬ 
tions  is  better  than  the  control.  Otherwise,  fir  maker-  ,1  final  o«  ■■■  i 
in  favor  of  that  population  among  the  selected  ones  whim  has  M10  la»c;ev*L 
p-valuc  under  the  associated  set. end  u.<f. 

If  these  tests  are  upper  levM  ,  (respectively  lower  l«*vel  ,..i 

1  ( 

which  for  simplicity  are  non  randomized  for  a  -aoment  to  f  i  -  idea--. .  ha  .»  ; 
on  real  valued  statistic',  lj.  and  v  .  ,  i  I,  then  t  he  pent  eduee 


4 


described  above  can  be  equivalently  described  as  follows:  At  Stage  1  all 
71  ^ '  s  are  selected  with  >  c.  (where  c^  is  the  uj-fractile  of  U|  under 
o.  -  0Q),  and  final  decision  is  made  in  terms  of  the  largest  V.  among  the 
selected  ti.'s,  provided  V.  >  d.  (where  d.  is  the  x?-quanti  le  of  V  under 
6^  =  0o).  The  truncated  version  of  such  procedures  (i.e.  which  perform 
Stage  1  only)  have  been  studied  by  several  authors.  See,  for  example, 

Gupta  and  Sobel  (1958)  and  Lehmann  (1961).  Also  some  work  has  been  done 
in  sequential  setups.  For  references  see  Gupta  and  Panchapakesari  (19/9), 
Chapter  20.  But,  apparently,  no  results  concerning  two-stage  procedures 
of  the  type  described  above  have  appeared  in  literature  until  now.  This 
gives  us  the  motivation  for  the  following  considerations. 

To  begin  with,  let  us  state  without  a  formal  proof  that  by  similar 
arguments  as  are  used  in  Miescke  (1979a)  it  can  be  shown  that  every  pro¬ 
cedure  of  the  type  described  just  now  -  where  U . ,  c ■  and  V-,  d .  more  gen¬ 
erally  may  take  values  in  measurable  spaces  X^  and  X  ■  ,  and  where  IJ.  and 
are  stochastically  non-decreasing  in  r.  with  respect  to  measurable  total 
orderings  in  X^  and  ^  .  ,  i-l,...,k  -  is  a  member  of  the  class  t,  to  be  de¬ 
fined  below. 

To  avoid  confusions  and  to  arrive  at  a  consistent  representation  of 
this  class  let  us  from  now  on  use  only  tests  for  H.  versus  K . ,  i 
which  take  value  1  as  soon  as  one  observation  falls  outside  support.  Q. 

(This  modifies  procedures  only  on  null-sets.)  Finally,  several  definitions 
given  in  Miescke  (1979a)  will  be  relevant  in  the  sequel  but  for  brevity 
are  not  repeated  here.  Especially,  tests  may  be  randomized  ones  taking 
values  in  [0,1].  This  typically  occurs  in  discrete  cases  or  in  continuous 
cases  when  nonparametric  (rank)  tests  are  under  concern.  Thus  signifiiame 


[) 


statements  as  well  as  p-values  are  understood  to  be  based  on  additional 

randomization  schemes  as  are  used  in  Miescke  (1979a).  To  be  more  specific 

let  A  =  (A^,...,A^)  be  that  one  for  the  first  stage  and  B  r 

that  one  for  the  second  stage.  Note  that  X,  Y,  A  and  B  are  assumed  to  be 

independent. 

The  class  A  of  two-stage  procedures : 

For  i  =  l , . . . ,k  let 

(1)  ct  =  (S  .  j  rn  ,n  be  a  right-cont inuous  and  monotone  (in  t) 

unbiased  test  for  versus  K.  based  on  X.,  which  is  standardiz¬ 
ed  at  n  .  Assume  that  within  Q  ;  0  and  s  1.  Let 

o  M  o  1 

<l-  (■.  1  k  ) . 

(2)  Analogously,  let  ^  I  c  j-(J  -j  j  be  such  a  test  for  ?j.  ver¬ 
sus  based  on  ( X . ,  Y  . ) .  Let  r  ( ,  y  . .  .  ,,  ^ ) . 

for  0  <  ,t-|  1  and  0  •  ^  •.  1  let  ( g  ,1  -■ ^ )  denote  tne  following  two- 

stage  procedure: 

Stage  1:  Select  »r .  if  p  (X-,A),  the  p-value  of  X.  under  r . ,  is  larger 

J  ^  Kg  .  -i  i '  h  -i  l 

‘  l 

than  1-  i-j ,  i'-l,...,k.  If  none  (exactly  one)  of  the  population  s  is 
selected,  stop  and  decide  "none  (this  one)  is  better  than  Other 

wise  proceed  to  Stage  2. 

Stage  2:  Among  the  selected  populations  cjeiido  finally  in  favor'  d  tne  f 

n.  which  has  the  largest  p-value  p  ( X  ^ ,  Y  ^  ,  R  . )  on  Ire  ppiiu,-:  >’ 

is  no  smaller  than  Otherwise  decide  that  "none  is  hotter  tnan 

2  o 

let.  jv  be  tne  set  of  all  sucn  two-stage  procedure1.. 

The  following  result  will  servo  as  our  basic  tool  to  determine 

(  *  j  ,-t0 )  -  tupl  e*>  for  meeting  the  P*-condition  as  well  as  to  compare  tne  pm 


formance  of  competing  procedures  sa  1. 1  \  t  y  l  n  •:  the  P74 -(  ondi  t  i  on 


Theorem. J  .  Let  (s  ,1-u^  €  A  .  For  notational  convenience  let. 

E.  =  E  <f.  t  (X.)  and  F.(-i)  =  E  {b.  (X.,Y -):X-  is  not  significant  under 

i  u  •  1 1  i  —  u -I  “I  ■  —  i  '*•  l,  i  -l  -i  *-i  u 

11  \c 

S  .  i  },  i=l,...,k.  Then  for  every  non-empty  Dc  and  ( 

*  »  1  “  u  -J 

(2.1)  Pf.{final  decision  of  ( s ,  1  -  tails  into  Dr 


=  /  n  [E.+O-E  )!  (,)'Jd(  II  [E  :  +  ( 1  -E  ■  )l  •  (■<)  ]  I 

•  >  n  J  J  J  _•  1  1 


<%2  j  f  D 


where  integration  is  with  respect  to  t.  Moreover, 


(2.2)  P  {final  decis  ion  of  (v  does  not  fall  into 


fl CE,+(1-E,)F 


J  J  J 


Proof;  It  is  shown  in  Miescke  (1979a)(cf.  (2.3)  there)  that  the  distribu¬ 
tion  function  of  each  p-value  appearing  in  (s  ,1 equals  to  the 
power  function  of  the  corresponding  test,  which  hereby  is  thought  of  being 
a  function  of  *€[0,1]  where  parameter  on  the  other  hand  is  held 


fixed. 


Let  D  <_  (1  , . . .  ,k } ,  D  /  0  ,  e  €  U  ,  0  <  t -j  -  1  and  0  ■  1  be  f  i  x- 


ed.  Then 


Pn{ final  decision  falls  into  Of 


7  P. (final  decision  is  in  favor  of 


)  )  P  {the  n.’s  with  i  Cs  are  selected  and  final 

*-•  u  n  l 

r  €  D  s  :r  €  s  - 

decision  is  made  in  favo»'  n! 


I  !  II  f  ,  (  OdF  (0  II  (1-L .  )  II  r 

„  r  „  :  /  .  •  *  .*  r  ,  1  ;  . 


r  f  n  r.:r  (.  s  i  (.  f. 

i  /  r 


II  U-E,)F  (.) 
i  c  s  11 


ns  vo'-w-n- 

)  /  r 


k 

Now,  the  integrand  I  —  -  equals  to  II  [  E .  -» ( 1  -  E ,  )F  .(  ,)  J  and  (I-l  ;■ 

i  I  1  ’ 

i  /  r 

can  be  replaced  by  E  +(1-E  )1  r(  t).  Thus  P  final  decision  tall--  int"  ' 
equals 


i/r 


»  /  II  [E>(i-t.)r.(  «)j  -  II  .)i.i .  j  its:  i  ni-;.  •; 

t?  j  ^  I)  1  J  '  r  C  D  i  (.  D  1  11  '  ! 

i  /  r 

1  TT 

-  f  II  [C  •  +  (1-Ei)r  .(,}]d(  II  [E.  +  O-E.  )!•(.)]  1  . 

>2  j  <5  r  J  -  J  uo  11 

This  completes  the  proof  of  (2.1).  Since  (2.2)  can  be  verified  by 
i M'|  similar  arquments  its  proof  is  omitted  for  brevity. 


Komar k :  Note  that  for  i=l,...,k  we  have  also  tnc  fol lowinq  representor i 

of  1.  j  t  ( 1  -  L  j  )  i'  .  (  «).  ,  (  [f',1  ]  : 


P,  ( 

i 

Coro  11  d  ry  1.  [very  two-stage  procedure  .  j  -  ,  , .  ,  }  (  ,,  t^isf  irs  *f,r 


(  X .  )+  ( 1  -  •* 


/  v  }  \ 


1 


!  -  1 


/  ) 


1  p  (>-  ,A  -  )■  1-  t]  J  L.  •  i 


,  and  i> 

i 

i 


Precondition  if 


k 

P  ron  *'  •  \  i  owe  r  bo  u  n  J  for  ( A .  2  j  is  II  L  .  which  s  a  t  i  s  t  i  e  s 

j-  i 


k  k 

k 

ii  t,  n 

1  (?j)  11  i  =  : 

( y . ) 

j  -  1  ■'  J  '  1 

J  1  .1  1  u 

‘I  ~  1 

if 

"l  '  0* 

This  f  o  1  1  e  w s  f ron  t.  he  unhid 

st'dnes  s  vd 

the  test 

Unfortunately . 

,  the  dependencies  between 

and  i  1  .  , 

i  =  l 

, . . . ,k ,  make  i t 

hard  to  find  good  procedures 

in;,.  If 

•eretorv  mo  f 

our 

results  in  the 

sequel  will  be  given  only  wi 

tii  respect 

to  x  \  sav,  u 

i 

Jb 

c  os  consists  o t 

al  1  procedures  from  j>>  where 

the  tests  .. 

. .  in  td i » ■  s r c.  u 1 1 

stage  defend  only  or1  the  Y .  ‘  ^  and  not  on  the  v  s,  i  i  h*  •  ud-t 

ed  reader  is  invited  to  try  to  prove  one  of  the  v„onjec  tun*-.  stated  «if  m 
end  of  this  section. 

Corollary  ]'.  A  two-staqe  procedure  (  ,  %]  -  , .  ,  t  . )  <  i  '  satieties  *  in 

P*-c:ondi  t  ion  if  and  only  if 

(2.5)  0-  y  ■]  «'*  • 

Proof:  let  ■■  ^  , .  .  .  ,  ^  .  r,ji  (  ,  1  - 1  ^  ,  i.  J  (  ‘  (A.  A)  reduces 

k 

;  |  '  j  .M-  '-I  'i  -1  •  '  -  ’  j  '-I 

which,  by  the  unb  i  asedno,s  of  th*  test.,  and  *  ne  tact  that  u*(l--alh 

b+(l-h)a  :s  inc-easihi!  in  i,  o  <  jO,i],  a--Mimes  its  lowest  value  at 

...  ■  ,  ,  where  the  newer  tun<  •  ion  .  am  equal  to  tin*  N*vt  1  .  id 

1  I-  o 

the  testa;. 

Remark:  It  in  a  procedure  (  ,1-  (  >.  every  pair  of  V-d-  ,  , 

I  -  1  i 

have  non-nn'M  t  i  ve  correlations  for  ,  i  k  (which,  id  i  inir.r, 

i  o  j'. 


is  (liven  1  f  the  prnurl'jr*1  hr  rams 


),  t  he*  i  he  int'm.jr 


k  k 

•  -i  falls  between  (l-i^)  arid  (l-^*ipv)'  . 

Let  us  from  now  on  adopt  the  following  convention: 

Convention:  A1  1  procedures  from  A1  are  assumed  to  have  an  <j  satisfy- 

i rig  (1- t^)  =  P*  and  a  small  ip. 

t. 

In  view  of  (2.5)  ( 1  -* t -j )  -  P*  and  ip  r  0  clearly  is  the  Lest  choice 
minimize  the  expected  overall  sampling  amount  (and  to  make  (2.4;  to  an  e/ 
act  condition  with  respect  to  j!>  ).  But  on  t  fie  other  hand  an  experii  renter 
might  feel  restricted  at  not  being  permitted  to  decide  also  at  blase  l 
aqainst  all  populations.  Thus  let  us  admit  at  least  a  small  This  is 

slightly  conservative  with  repsect  to  (2.5).  But  it  changes  the  nrobabil 
ities  of  any  events  at  most  by  a  difference  of  (max  «  ■ '/ .  This  fol¬ 

lows  from  the  fact  that  Lp  acts  only  on  profoabi 1  it ies  of  events  where  at 
least  two  populations  pass  Stage  1  and  eventually  are  rejected  nt  Stage 
To  give  a  numerical  example,  take  ^  :  10  .  Then  fcr  k-  5(13;  we 

have  a  P*  above  0.95  (0.90)  and  t0  cnancjes  all  probabilities  at  most  by 

-4 

the  amount  of  10 


3.  Consequences  arid  extensions 

The  following  two  results  will  be  used  repeatedly  in  t up  sequel.  Th 
proofs  are  straightforward  using  integrations  by  parts  and  are  tneretore 
omitted  for  brevity. 


Lemma  1.  Let  G^,  G^  :  [0,1]  >  [0,1 J  be  right-continuous,  non-decree 'ing 
with  G.(l)  -  G_.(l)  -  ]  and  G  .  {%)  •-  ()G.{>. )  for  i  1  , . .  .  .  r(  i  ri  1  , .  .  .  ,k  )  , 
r  C  { 1 , . . . ,k  \ .  Then  for  0  •  l9  1 


]  r  k 

/  II  g  .  (  i)<j(  II  (;.(.)) 

«?  i  - 1  1  j-rH  J  ’ 


r  k 

II  f’>, i  H(  II  '..(■)) 

i-1  '  '  i  --r4 1  1 


(  i.l) 


As  a  special  case  of  Lemma  1  we  fjet 


Corollary  Let  Gj - ,Gj.  as  before  and  G}  {  «)  •  G  (  •),  lot  a 

pa i r  1,  j  c  !  1 . k;  wit *•  i  /  j.  Then  for  0  -  ■  1 

1  k  Ik 

(3.21  ■  II  (  i)d'.l ■  (  i)  -  f  II  a  (  ,)dG ■(  .)  . 

i  '  J  u  -i 

O  ‘  ‘  t i 

ror  the  sequel  let  P  -C.i).  \  denote  the  probabi  1  i ty  ot  a  curror  t  dmi- 
sion  at  •  (  ,  i.e.  that  the  final  decision  falls  into  Rvj)  -i;--. 

i-l,...,kr  if  R(-i  is  non-empty  or  that  the  final  decision  is  "no  popula¬ 
tion  is  better  them  /'  it  R{_)  is  empty. 

Corollary  C.  1  et  (  .  ,  i  -  ^  ,,,  s.J  (  t1,  R  * ^  *  1  and  0  *  1.  If  ffn 

every  i  C  and  .  j  are  UMP  unbiased  tests  for  versu  >  t-  based 

on  and  and  if  >>  imul  laneously  1-s-  and  1-.  ;  are  UMP  unbiased  tests  for 
the  dual  testing  problem  (where  the  hypothesis  and  the  alternative  are  int 
changed )  fnen  at  every  c  ^  (s  J- •.-]«*;.  i.,)  has  the  largest  P  -U.-  and 
the  smallest  expected  sampling  amount  among  all  {  _  ,1- - t  i  1  . 

As  is  well  known,  tnese  condition^  art?  usual  lv  fulfilled  in  one-pura- 
meter  MLk  and  mu  1 1 i parameter  exponential  family  situations.  The  proof  ot 
Corollary  •'  a.  well  as  that  of  the  next  result  follow1,  from  (i’.l)  and 
Lemma  1 . 

Corollary  4.  Let  (;,  ,  I-  ;  *:  If  the  power  functionsof  all  test 

are  non-decrea  »in<j  (nor;  increasing)  in  ,ump1t?-si/es  for  ■  (  )  then 

k 

P  vC.O.  1  is  nondecre.i  . incj  in  :,a»i|do  sizes  at  every  _•  ( 

The  next  result  cun  be  slated  with  respect  to  . 


Corollary  5.  Let  (s  ,  1  -  *  ^ ,  v ,  i  ^ )  (  A  where  L  consist1-  of  consistent  ter: 
Then  P  ({C.D.  i  converges  to  1  if  n.  ►  *•  and  ■  .  f  •  ,  i  l,...,k. 


Proof:  Let  •.)  C with  o  f  i~l,...,k.  Then 

PMir.l).  i  •  P()lonly  i^.'s  with  i  c  R(")  a  re  selected  at  Stage  1  .* 

II  £,,  ■<  i  ,  (X.)  II  []-£ .  ■.  ■  ,  (>.J  I 

.1  4R(2)  j  Jfl  l  "J  i  c  K(.)  i  '  I 

which  tends  to  1  for  large  IfV  the  consistency  of  the  te*  ts. 

Under  the  assumption  of  monotone  (non-decreasing)  likelihood  tv.  tic 
(MLR)  a  stronger  result  can  be  obtained. 


Theorem  2.  Assume  that  in  every  population  ..  the  family  of  densities  nu\ 
MLR,  i~l,...,k.  Let  (s  ^  C  ^  (or  i.')  consist  of  the  UMP  tests 

for  the  corresponding  testing  problems.  Then  for  increasing  ‘..ample  sizes 
n.,  nn  ,  i=l,...,k,  P  {final  decision  in  favor  of  that  •.  with  the  largest 
•  a  \  tends  to  1  for  all  •  C  with  R(<)  f  0  and  P  {final  decision  i  - 
"no  population  is  better  than  •*  "  ■  tends  to  1  for  all  -  C  wi t h 

I . ’k  '  o  * 


Proof:  Let  n  ■  -•  •  ,,  0  ■  *  1  and  P  ».  •  I. 

k  o  I  1  **  i  1  / 

P  { ( s  J- , C'vt^)  finally  decides  i..  \ivu-'  >  f  ■  .  , 


flow,  P  {p  (X,  ,A,  )  •  1  -  • .  ; p  (X  , Y.  ,B  }  i.,  tends  to  i  tor  largo  n  .irg 

*J  k  I  -  k  -  K  L  .  k 

m^  hy  the  consistency  of  the  tests. 

Moreover',  the  procedure  winch  decides  if'  fuvor  of  a  rope :  ,tf  i  on  ,k  cod¬ 
ing  to  the  largest  p-valuo  with  rvspoct  to  i  it,  «  *  viewed,  »  go ;  l  et  ’  I  , 


as  he l ng  based  on 


Say,  whfM'e 


*  v  * 


now  standardized  at  (in.tea'J  of  i  But  then  since  al  1 

►  o ' 

populations  ■;  p  . .  .  ,  ■.  ^  j'o  shitted  info  “alternatives", 

1  k-l 

P(,{p  ^  p,  J/ki  ~  P  p  •  p  ,i/ki  -  /  II  tti  ( X  ■ , Y  ■  )d  i  , 

-  *'k  M  -  M-  '  Oil  i  ’  1 

(cf.  Miescke  (  i  3 / 9 * i ) )  which  tends  to  1  for  larue  im^  , . . .  ,ni^  .  When  more 

than  one  •  ts  lamest  a  similar  ;esult  can  be  derived.  The  proof  for*  s. 1 

i  r 

using  UMP  tests  ^  based  or:  Y^,...,Y^  is  exactly  the  same.  I  he 

second  assertion  of  the  theorem  is  already  proved  by  Corrollary  b. 

Remark:  If  the  a-.yihptct i c  relative  efficiency  (ART)  in  the  sense  of  Pit . 

is  defined  in  ten  .,  of  flu  prouahi  1  i t y  of  selecting  the  •i-'s  with  i  v  K(j) 
at  Stage  1,  the*.  m  APli-,  • ,  ,  •  ;  dors  not  depend  on  i  C  fas  is 

i  i 

typically  the  case  when  s  j . »  .  are  of  t he  same  type  and  .  |  ^ 

are  of  the  same  iype)  we  hove  r  )  ARL((,»  - 

for  all  l)  .  *  1,  U  ,,,  1  ..nd  ..  Ihe  proof  is  similar  to  that  in 

Miescke  (1979a).  Pd  <  miris  ,  it  would  be  -mom*  satisfactory  tu  have  an  AKl  - 
concept  including  buffi  .fa;  .  Mir.  snor-.  to  be  a  difficult  problem. 

In  fact.  Pitman's  npurojcn  does  not  h  a ■'  non*  to  clear  conclusions. 

Corollary  b.  I  r 1 :  i  ;  .m  .  if  tru*  power  turn,  (ions  of  ■  .  and 

Vj  are  non-  i  ncrens  mu  m  .  »  .  .  i  .  ►  ,  t.  her,  t  .»r  s  <  1  ,  .  .  .  ,k  with 

r  '  s 

( 3 .  i )  -  final  dm  sr-n  ■  i*.  Mver  o*  ■  ■ 

I '  '  :  n.l  ’  dm  i  Mtii;  i  .  in  t  <i  vor  of 


Proof 


[hi  ,  to  |  love.  \  *  tvs 


!  I  .O'  i  1  0»  ol  1  ar  v  /. 


F  i  nal  Remarks : 


(1)  The  results  so  far  derived  hold  analogously  in  cases  where  the 
control  values  may  depend  on  i  ( 

(2)  The  case  of  unknown  controls  can  be  treated  analogously  provideu 
that  control  samples  are  drawn  i ndependertt ly  for  each  single  test. 

(3)  Under  the  assumption  of  MLR  let  (s  s,  consist  of  the 

UMP  tests  for  the  corresponding  testing  problems.  Then 

(3.4)  inf  ( P  {C.D. )  (n  Cuk  ,  R(o)  =  0  ,  -  inf  i  P  -C.I).  ,  •  <  .  h  k(  ,  /  0 


v  o’  0 ’ *  ■  ’  '  0 ' 

This  follows  from  Theorem  1  in  a  recently  published  paper  b>  Simons  J9;;o) 
(2.1)  and  Lemma  1 . 

(4)  Let  us  conclude  this  section  by  stating  the  following  three  im¬ 
portant  questions  that  have  not  been  settled  now:  Assume  that  in  all  popu¬ 
lations  MLR  is  given  and  that  S  -  are  the  UMP  test,  based  on  >  , 

(XrY.),  i - 1 . k . 


I)  Is  ( S  ,  performing  better  than 


'  <  i -  «i  *  . 

i  - 


II)  It  this  is  true,  Mow  well  performs  (’,,!•  ,)  in  „  . 

Ill)  The  one-stage  procedures  (  ,  1-^,)  and  which  'rle  :  at  i 

inq  to  the  largest  p- values  of  the  correspond  i  ng  test/.,  providrq  t*ult  t  n- 
are  larger  than  1-t^,  are  trie  natural  competitor,  to  ; 

(  .  ,1  ijfi,0).  ^,0  they  need  larger-  sampling  amounts  t  ■  Mir  t  iv.  ■  tun 
pro*  edurrs  t,jke  in  the  near*  to  have  thi  same  p  .9. 


4.  I  he  normal  case.  Bayesian  two  stage  procedure,  s  tor-  t 

As  sum/*  tn.rt  we  have  k  no*  ml  eopijJa*  ’on*.  wi  t  »■  m  m"h;> 


i  lit  I  mhiwii  van  rimes 


l  1  , .  . 


! » ,g  I  li  •:  n  *  M  ■  Mu 
1 


means  derive*!  ftom  t  he  .  ii: »?  !e\  ot  si/es  n  anti  rs  from  ,  i  ! . k.  |<>f 

!  :  i 

*  denote  the  cumulative  d  i  ■  I  r \  Lt  u  i  on  Unction  of  the  standard  nunii.t  I  disft 
bution.  Then  the  opt  i\?j.  !»?  *»<  edniv  in  ;  '  ■  s  as  follows: 

Stage  1:  Select  all  population*  ,  1  with 


US:  -  ;  i  > 


'  *  i  ■  1 . k  ■  . 


If  S  -  ^ . j  1  ( c"  vV  sfoo,  and  decide  in  favor  of  ■  ("none  is  bolter 

J° 

than  l,0").  Otherwise,  proceed  to 

Stage  2:  Dec  id**  tm.iily  ir  Uvor  el  y  if  rCS  with  ?  *  Yc  ,  s  <  S  and 

V  ■  ^  •  m  *  V  ) .  otherwise  decide  that  no  population  is  hettf 

y  o  r  r 

than  the  control. 

Let  /.  (ri-On.)  ‘(n.\  * m  r),  i  be  the  overall  sample 

i  li  l  i  :  l 

Though  we  do  not  ►  r.uw  woe  M;er  Mu-  alternative  procedure  which  uses  /.'s 
instead  of  Y.'\  in  htage  .  per  fir-":  better,  we  ran  at.  least  show  that  in 
(2.1)  the  f,jf  if  i',».  .  r  t  f  [  :  . ,  1] ,  with  *•  *  will  t  he  I  I  he 

replaced  t>y  sma !!•’»•  ■  unU  U» .. .  •  i  [ 


-  *  f  2  '  M  *  '•<*  v\  i  1  , .  .  .  M  . 


Let  ••  *  . 

1  o 


I  ;  . 


♦  ( n  4  in  .  )  *  •  ■  .  ;  i  i 

oil  ) 


,  /  I  •  /  .  4  (  f)  •  On  )  ’ 

I  .  1  o  1  1  l 

I 


This  t  n  1  1  ow  :  » *»;  i  '  I  «s 
1  y  corre  1  a  ted.  :  i  n<?  ;  U  t  h» 


f*  ...i  •  lit;  i  "m 
i  1  '  n’Mp  1  e  :  (*.)  t  ■  / 


*  a ud  Y  are  ;•» i s  i  *  I  ve 

)  l 


P  •  /  -  *  M 

•  i  i  .i  i  • 

1 


(Ml  e  ^  • 


1  i  t  O 


m  *  .  y  -  ! 

ill  n 
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When  the  variances  are  unknown  the  optimal  procedure  in  '  is  based 
on  t-tests  in  an  analogous  way.  Let  s.  arid  s.  denote  the  usual  MtfV  unbi¬ 
ased  estimators  of  o.  based  on  X.  and  V  • ,  respectively ,  i-l,...,k.  Then 
n.*o*‘i>  at  Stage  1  has  to  be  replaced  by  n  >s  -  t(n . -1  J  *  ^  ) ,  and 

mr*lV*  ^  ^  at  ^  ^as  t0  be  reP^acecJ  by  s  ^  t(ni  - 1  ,  . ) ,  where  t(n,.) 

denotes  the  t-quantile  of  the  t-di stribution  with  n  degrees  of  freedom. 

Though  a  (Bayesian)  decision  theoretic  approach  is  quite  difficult  to 
perform  in  general,  the  case  of  kr  2  populations  can  at  least  be  sludieu 
to  some  extent.  A  two- stage  procedure  will  now  be  described  by  S(X)(the 
random  subset  of  ily2l  of  indices  of  those  populations  being  selected 
at  Stage  1)  and  d(X,Y)  (the  final  decision  at  Stage  2).  As  before, the  pro¬ 
cedure  stops  and  decides  0,  i.e.  "none  is  better  than  ■  "  i* 

S  ( X )  *  {h  (  and  d  ( X ,  Y )  at  Stage  2  is  used  only  if'  $(XJ  . 

Let  :  F  ►  ]R  ,  wi th  •  (0)  -  0,  be  a  non-decreasing  function  which  acts 
as  loss-gain-function  with  respect  to  final  decisions  ]  and  /.  Assume 
that  decision  0  leads  neither  to  a  loss  nor  a  gain.  Moreover,  let  c  u 
be  the  costs  we  have  to  pay  if  we  wisn  to  perform  Stage  2.  lieally,  let 
1  be  the  prior  distribution  of  t  he'  (now  random)  parameter  vector  .  -he’ 
the  overall  Bayesian  risk  is  gi  '■  s  bv 

0.1)  /  jlc  *  V  .(  n-  vr !«);.;  .  .  I..-  !  j!  u.  [ 

k  i  I 

7 

f  .  i  -  M)  js ; .  ;  t  (  |  • 

i  i  '  °  -  » 

[he  optimal  dec.  is  ion  d*  a*  Stage  when  ■:  <  r-i/es  tm-  t  •  ■ »  » ■  i  t*  p. 
petted  loss  given  X  and  Y  ■  y  does  eO  e-per-  i  "u  U>e  ,;*rMu)  itiui  i  ■-.? 
any  subset,  selection  rule  S  arid  turn  uu?  So  be 


1 

i 


.i 


lh 

(4.2)  d*( f, » t, )  =  i  iff  Ef  ■-  (t*o-  Y=L, } 

<•'  niinja.t;.  ('0-o.)|X^,  Y=ni|,  ! i ,  j  i  il,2|  , 

and  d*(f ,n)  -  0  otherwise. 

The  optimal  subset  selection  rule  $*  at  Stage  1  (which  minimises  the 
posterior  expected  loss  given  X  under  the  assumption  that  d*  will  he 
used  at  Stage  2)  decide'  according  to  the  smallest  of  the  four  values  given 
in  the  following  scheme: 

(4.3)  S*(fJ  -  0:  0 

5*(f)  *  'il:  r^(0-nj  :X  -  id,?  , 

S  *(f)  =  i  1 ,2  , :  L+Lyninjo,  min  Fw(-  ) :  X=  •.  ,Y  t }  i  Xv.  i  . 

il ,2  01  ‘  '  > 

Note  that  in  tne  last  expression  the  inner  conditional  ?xpect.i  at  ion  is 
viewed  as  being  a  function  of  V,  and  that  the  outer  one  is  the  expecta¬ 
tion  with  respect  to  the  conditional  distribution  of  Y- given  X 

Now  let  us  assume  the  follow  inn  normal  model:  t ondi tional 1 y,  given 
0  =  o,  X  and  Y  are  independent  with  X  f  (•  ■ , p J )  arid  Y  :*  (",ql),  dud 
apriori  -  ?.  ('0],rl),  p,q,r  •  0,  I  ■  ]’  (1,1). 

Then  by  using  for  convenience  U,  V,  ,  V.,,  which  are  assumed  to  ho  in¬ 
dependent  standard  normals,  we  get  the  following  scheme  equivalent  tn  (4.  0 

(4.4)  S*C)  --  0  :  0 

S*(  •  )  -  i':  F,Jf  .(r  :  1  (•  -  • )+  Op  f  p+r  )_1 )  •  U  ) ) ,  i  1.2 

S*(  )  :  il,?.:  cH  1  ‘‘ 1  m  j  n  J(),m  i  n 

i  1 ,  :j 

I1'1  (  •  (-(pM'/f  ()-  j)'.  V.'-Il  ;)  J] 
where  t  -  ur[ (p+r ) (pq+pr+gr ) ]  *  and  i  I  rpq/ ( pq ♦pr+qr  )  ]  *  . 


I 
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Let  especially  i  be  linear,  i.e.  v  (a)  a/.,  AC  IR  ,  where  we  car)  as¬ 

sume  without  loss  of  generality  that  a-  1  holds  (since  this  can  be  com¬ 
pensated  by  c).  Moreover,  let  us  for  a  moment  restrict  our  consideration 
to  two-stage  procedures  which  at  Stage  2  are  not  permitted  to  make  deci¬ 
sion  0.  (This  corresponds  to  procedures  in  or  jtwith  ^  -  0.)  Then 
the  optimal  procedure,  denoted  by  d*  and  S*,  can  be  described  in  a  com  it 
form. 

(4.5)  d*(f,n)  =  i  iff  qf.  +  pn-  *  q  ..  *  p?i . ,  \  i ,  j  \  ■  <  1  H?  .  , 

~  '  i  ,j  J 

and  S*  decides  according  to  the  smallest  of  the  4  values  given  in  the  fol¬ 
lowing  scheme: 

(4.6)  S*(0  -0:0 

s*(f  )  =  til:  i  =  1  ,  2 , 

S*(r)  =•  {1,2):  ^(‘.0-rnaxtrr  ,?0+c-2i-i  T(-  (?-.)' 1  ] ;  )  , 

- 1  y 

where  .s  =  r(p+r)  and  T(y)  =  /  *(x)dx,  y  (  IR  . 

The  last  expression  follows  from  Lemma  3  in  Miescke  (1979b).  finer  1 
is  an  increasing  function  with  T ( 0)  the  procedure  will  never  ar¬ 
rive  at  Stage  2  if  c  •  *•«"-.  But  on  the  other  hand,  let.  c  ■  .  .  a-.  be! 

an  ii.  with  Aj  .  j ,  '.'0,  ti  ,j  lr  (1 ,2  ! ,  wi  i  !  !>e  selected  by  S*.  Rut  now  if  ^ 
or  •  "0  then  S*(»p  *  {l,2i  if  and  only  if  / T  ' 1  (  (?•  .  )'L 
Moreover,  if  c -j  ,  ^  ■  0  there  is  an  area  in  the  neighborhood  of  (■  , 

where  also  S*(\)  -  {1,2  occurs.  Thus  within  JR'S-'  ,  •  s  is 

■*  -  I  «•  0  * 

of  the  type  of  Gupta’s  (1966)  maximum  means  procedure. 

If  now  more  generally  a  decision  0  is  also  admitted  at  .Stage  ,  then 


!  o>  e 

o 
I  . 


the  optimal  procedure  (S*,d*)  is  of  similar  form  but  is  no  longer  rep. o~ 
sentable  in  such  a  conrise  manner.  typically,  the  area  where  at  Mane  1 


1 H 

both  populations  are  selected  will  be  larger. 

Finally,  let  us  mention  that  one  gets  analogous  results  it  otner  loss 
functions  are  admitted.  It  is  thinkable  that  especially  *  ■  ^(-.  J 

if  i\  >  (<)0,  ?2  r>  0,  leads  to  a  procedure  which  is  closer  to  that  one 

given  at  the  beginning  of  this  section.  But,  unfortunate! y ,  its  representa¬ 
tion  is  more  complicated  such  that  this  question  could  be  studied  only  num¬ 
erically. 
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tributions  which  are  (at  least  partly)  determined  by  real  parameters  j,..., • 
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i  =  l, . ..,k,  where  n  f  IR  is  a  fixed  given  constant.  The  goal  is  to  guarantee 
at  least  a  probability  P*  of  making  a  correct  decision  if  ^  <  e  , 
and  to  maximize  the  probability  of  finding  a  population  better  than  ,  other- 
wi  se . 

Two-stage  procedures  of  tne  following  type  will  be  studied:  At  Stage  I, 
based  on  samples  all  populations  are  screened  out  which  appear  to  be 

no  better  than  n  If  none  (exactly  one)  is  left  the  procedure  stops  and  de¬ 
cides  that  none  (this  one)  is  better  than  .  If  more  than  one,  ;i .  with  i  cs, 
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survives  then  one  proceeds  to  Stage  ? .  Here  additional  samples  Y^,  i  es,  are 
drawn  and  final  decision  is  made  based  on  Y .  or  (X.,Y.),  i  (  s. 

A  natural  class  of  two-stage  procedures  is  proposed  which  can  be  complete¬ 
ly  described  and  studied  in  terms  of  Neytnan- Pearson  testing  theory,  where  the 
ur,  symmetry  of  tests,  however,  can  be  overcome  to  a  considerable  extent.  As  a 
typical  result  it  is  shown  that  optimality  of  tests  carries  over  to  optimality 
of  two-stage  procedures.  Finally,  under  normality,  comparisons  are  made  in 
case  of  k  -  ^  with  certain  Bayesian  procedures. 
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